Restless Poachers: Handling Exploration-Exploitation Tradeoffs in Security Domains
نویسندگان
چکیده
The success of Stackelberg Security Games (SSGs) in counterterrorism domains has inspired researchers’ interest in applying game-theoretic models to other security domains with frequent interactions between defenders and attackers, e.g., wildlife protection. Previous research optimizes defenders’ strategies by modeling this problem as a repeated Stackelberg game, capturing the special property in this domain — frequent interactions between defenders and attackers. However, this research fails to handle exploration-exploitation tradeoff in this domain caused by the fact that defenders only have knowledge of attack activities at targets they protect. This paper addresses this shortcoming and provides the following contributions: (i) We formulate the problem as a restless multi-armed bandit (RMAB) model to address this challenge. (ii) To use Whittle index policy to plan for patrol strategies in the RMAB, we provide two sufficient conditions for indexability and an algorithm to numerically evaluate indexability. (iii) Given indexability, we propose a binary search based algorithm to find Whittle index policy efficiently.
منابع مشابه
Balancing Exploration and Exploitation in Alliance Formation
Do firms balance exploration and exploitation in their alliance formation decisions and, if so, why and how? We argue that absorptive capacity and organizational inertia impose conflicting pressures for exploration and exploitation with respect to the value chain function of alliances, the attributes of partners, and partners’ network positions. Although path dependencies reinforce either explo...
متن کاملExperimental Study on Boundary Constraint Handling in Particle Swarm Optimization: From Population Diversity Perspective
Premature convergence happens in Particle Swarm Optimization (PSO) for solving both multimodal problems and unimodal problems. With an improper boundary constraints handling method, particles may get “stuck in” the boundary. Premature convergence means that an algorithm has lost its ability of exploration. Population diversity is an effective way to monitor an algorithm’s ability of exploration...
متن کاملExploration and exploitation within and across intra-organisational domains and their reactions to firm-level failure
This study examines the evolution of exploration and exploitation within intra-organisational domains, specifically, the technological and market knowledge domains in high-technology firms. It simultaneously tests the interaction between exploration and exploitation across domains. Furthermore, this paper examines the impact of firm-level failure experience on exploration and exploitation withi...
متن کاملBalance Within and Across Domains: The Performance Implications of Exploration and Exploitation in Alliances
Organizational research advocates that firms balance exploration and exploitation, yet it acknowledges inherent challenges in reconciling these opposing activities. To overcome these challenges, such research suggests that firms establish organizational separation between exploring and exploiting units or engage in temporal separation whereby they oscillate between exploration and exploitation ...
متن کاملUnpacking the Exploration-Exploitation Tradeoff: A Synthesis of Human and Animal Literatures
Many decisions in the lives of animals and humans require a fine balance between the exploration of different options and the exploitation of their rewards. Do you buy the advertised car, or do you testdrive different models? Do you continue feeding from the current patch of flowers, or do you fly off to another one? Do you marry your current partner, or try your luck with someone else? The bal...
متن کامل